Hyperparameter Auto-Tuning in Self-Supervised Robotic Learning
نویسندگان
چکیده
Policy optimization in reinforcement learning requires the selection of numerous hyperparameters across different environments. Fixing them incorrectly may negatively impact performance leading notably to insufficient or redundant learning. Insufficient (due convergence local optima) results under-performing policies whilst wastes time and resources. The effects are further exacerbated when using single solve multi-task problems. Observing that Evidence Lower Bound (ELBO) used Variational Auto-Encoders correlates with diversity image samples, we propose an auto-tuning technique based on ELBO for self-supervised Our approach can auto-tune three hyperparameters: replay buffer size, number policy gradient updates during each epoch, exploration steps epoch. We use a state-of-the-art robot framework (Reinforcement Learning Imagined Goals (RIG) Soft Actor-Critic) as baseline experimental verification. Experiments show our method online yields best at fraction computational Code, video, appendix simulated real-robot experiments be found project page www.JuanRojas.net/autotune.
منابع مشابه
Auto-tuning PID Controller for Robotic Manipulators
This paper suggests an auto-tuning method of PID trajectory tracking controller for robotic manipulators. In general, the PID trajectory tracking controller for mechanical systems shows the performance limitation. Since the control system including performance limitation can not have equilibrium points, we define newly the quasi-equilibrium region as an alternative for equilibrium point. Also, ...
متن کاملHyperparameter Learning for Graph Based Semi-supervised Learning Algorithms
Semi-supervised learning algorithms have been successfully applied in many applications with scarce labeled data, by utilizing the unlabeled data. One important category is graph based semi-supervised learning algorithms, for which the performance depends considerably on the quality of the graph, or its hyperparameters. In this paper, we deal with the less explored problem of learning the graph...
متن کاملParameter Auto-tuning Method Based on Self-learning Algorithm
The central air condition system is a complex system. Aimed at the puzzle of optimal status adjusting by once setting parameter of fuzzy PID, the paper proposed a sort of parameter auto-tuning method of fuzzy-PID based on self-learning algorithm. It adopted parameter autotuning technique to adjust the PID parameters in real time so as to ensure good quality of control system. It combined fuzzy ...
متن کاملCollaborative hyperparameter tuning
Hyperparameter learning has traditionally been a manual task because of the limited number of trials. Today’s computing infrastructures allow bigger evaluation budgets, thus opening the way for algorithmic approaches. Recently, surrogate-based optimization was successfully applied to hyperparameter learning for deep belief networks and to WEKA classifiers. The methods combined brute force compu...
متن کاملThreshold Auto-Tuning Metric Learning
It has been reported repeatedly that discriminative learning of distance metric boosts the pattern recognition performance. A weak point of ITML-based methods is that the distance threshold for similarity/dissimilarity constraints must be determined manually and it is sensitive to generalization performance, although the ITML-based methods enjoy an advantage that the Bregman projection framewor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE robotics and automation letters
سال: 2021
ISSN: ['2377-3766']
DOI: https://doi.org/10.1109/lra.2021.3064509